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ABSTRACT 



Three-point weak lensing statistics provide cosmic information that complements two-point statistics. However, both statistics suffer 
from intrinsic-shear alignment, which is one of their limiting systematics. The nulling technique is a model-independent method 
developed to eliminate intrinsic-shear alignment at the two-point level. In this paper we demonstrate that the nulling technique can 
also be naturally generalized to the three-point level, thereby controlling the corresponding GGI systematics. 

We show that under the assumption of exact redshift information the intrinsic-shear alignment contamination can be completely 
eliminated. To show how well the nulling technique performs on data with limited redshift information, we apply the nulling technique 
to three-point weak lensing statistics from a fictitious survey analogous to a typical future deep imaging survey, in which the three- 
point intrinsic-shear alignment systematics is generated from a power-law toy model. 

Using 10 redshift bins, the nulling technique leads to a factor of 10 suppression of the GGI/GGG ratio and reduces the bias on cos- 
mological parameters to less than the original statistical error. More detailed redshift information allowing for finer redshift bins leads 
to better bias reduction performance. The information loss during the nulling procedure doubles the statistical error on cosmological 
parameters. A comparison of the nulling technique with an unconditioned compression of the data suggests that part of the informa- 
tion loss can be retained by considering higher order nulling weights during the nulling procedure. A combined analysis of two- and 
three-point statistics confirms that the information contained in them is of comparable size and is complementary, both before and 
after nulling. 

Key words, cosmology: theory - Methods: data analysis - gravitational lensing - large-scale structure of the Universe - cosmological 
parameters 



1. Introduction 

Weak gravitational lensing refers to the mild distortion of the 
light from distant sources by the large-scale matter inhomogene- 
ity between the source and the observer. One observable effect of 
weak gravitational lensing is the coherent shape distortion of the 
light sources, known as cosmic shear. Cosmic shear is sensitive 
to all cosmological parameters which have influence on the den- 
sity perturbations and/or the geometry of the universe, including 
those concerning properties of dark energy, which have been a 
key concern after the discoveries made by observations of super- 
novae, the cosmic microwave background, and t he large-scale 
structure (for a review see e.g. IMunshi et alj|2008l). 

Since its firs t detection in 2000 dBacon et al. 2000; 
Kais er et alJ l2000t IVan Waerbeke et all l2000t IWittman et alJ 
2000), cosmic shear has been developed into a competitive cos- 
mological probe. Its constraining power on c osmological param- 
eters is now co mparable to other probes (e.g. lSpergel et al. 2007: 
IFu et alJl2008h . 

With forthcoming large field multicolor imaging surveys 
(e.g. DESfl KIDSB EUCLIEB etc), photometric redshift and 
shape information of a huge number of galaxies will be avail- 
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able, rendering cosmic shear even greater statistical power. 
In particular, cosmic shear is consider ed to be one of the 
most promising da rk energy probes dAlbrecht et alJ [2006; 
iPeacock et al.i r2006) when the results of these surveys become 
available. 

Such constraining power will be further enhanced by the use 
of higher-order statistics. Second-order statistics measure only 
the Gaussian signature of a random field. Even if the primordial 
cosmic density field is Gaussian, non-Gaussianity will be gener- 
ated due to the nonlinear nature of gravitational clustering. Such 
non-Gaussianity in the cosmic density field will then show up 
via its lensing effect, leading to non-Gaussian signals in the cos- 
mic shear field. A common way of measuring non-Gaussianity 
is to use higher-order statistics. In cosmic shear studies, several 
authors have shown that the lowest order of them, i.e. the third- 
order statistics, already provide a valuable probe for cosmologi- 
cal parameter estimates; in particular it can break the near degen- 
eracy between the density parameter Q m and the power spectrum 
normalization q-r dBernardeau et al.l J 1 997t iJain & Seliaklll997t 
van Waerbeke et alj|1999t lHuilll999l) . A more recent study by 
Taka da & Jainl (120041) (TJ04 afterwards) showed that including 
third-order statistics can improve parameter constraints signifi- 
cantly, typically by a factor of three. 

But the ultimate performance of these future surveys still 
largely depend s on the how well the systematic errors can be 
controlled (e.g. lHuterer et al.ll2006l) . In this paper we focus on a 
particularly worrisome systematic error in cosmic shear studies: 
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the intrinsic-shear alignment, and demonstrate a way to control 
it for shear three-point statistics. 

In the weak lensing limit the observed ellipticity of a galaxy 
e bs can be written as the sum of the intrinsic ellipticity €\ of the 
galaxy, and the shear y which is caused by gravitational lensing 
of the foreground matter distribution. Here e D b s , e\ and y are com- 
plex quantities. Intrinsic-shear alignment is defined in two-point 
cosmic shear statistics as the correlation between the intrinsic 
ellipticity of one galaxy and the shear of another galaxy (the GI 
term. lHirata & Seriakll2004l) . Three-point statistics (e obs f4s e ot>s)' 
a correlator of ellipticities of three galaxy images i, j and k, can 
also be expanded into lensing (GGG), intrinsic-shear (GGI and 
Gil), and intrinsic (III) terms: 



(4 s 44s) = GGG + GGI + Gil + III , with 

GGG = (r'rV) , 

GGI = (4yY) + (4H) + (4Yr j ) . 
Gn = (^/) + (^y) + (^y) , 



in 



(444) • 



(i) 

(2) 
(3) 
(4) 
(5) 



Physically, if one assumes that galaxies are randomly ori- 
ented on the sky, only the desired GGG term remains on the 
right-hand side of (Q}. However, when these galaxies are sub- 
ject to the tidal gravitational force of the same matter structure 
(e.g. they formed under the influence of the same massive dark 
matter halo), their shapes can intrinsically align and become cor- 
related, giving rise to a nonvanishing III term. Furthermore, GGI 
and Gil terms can be generated when a matter structure tidally 
influences close-by galaxies and at the same time contributes to 
the shear signal of background objects, leading to correlations 
among them. 

In two-point statistics, the corresponding intrinsic (II) 
and intrinsic-shear (GI) terms have been subject to de - 
tailed studies both theo retically (e.g. ICatelan et al.l (l200ll): 
ICroft & Metzlerl d2000t); iHeavens et atl (120001); iHui&Zhang l 
(|2002|) : iMackev et al.1 d2002l): Jjind (120021): iHirata & Seliakl 
120041): IHevmans et al l (120061) : IBridle & Abdall al (120071): 
Schneider & Bridle ( 20 l y» a nd observationally (Brown et al. 
2002t IHevmans et al.l l2004t i Mandelbaum et all 120061 I200S 



Hirata et al. 



2007t lFu etal.1 120081: iBrainerd et all I200S 
Okumura et all 120091: lOkumura & Jind 120091) . Although 



the results of these studies show large variations, most of 
them are consistent with a 10% contamination by both II and 
GI correlations for future surveys with photometric redshift 
information. Especially, neglecting these correlations can bias 
the da rk energy equation of state parameter wo by as much as 
50 % (IBridle & Kind l2007t) for a ' shallow" survey described 
in lAmara & Refregierl "( 2*0*07h . For three-poin t shear statistics, 
there have been few measurements up to n ow dBernardeau et al.l 
l2002bl iPen etal.ll200l Uarvis et al.ll2004l) . However the poten- 
tial systematics level in t hese studies is found to be high. A 
recent numerical study by ISemboloni et al.l (120081) showed that 
intrinsic alignments affect three-point weak lensing statistics 
more strongly than at the two-point level for a given survey 
depth. In particular, neglecting GGI and Gil systematics would 
lead to an underestimation of the GGG signal by 5 - 10 % for 
a moderately deep survey like the CFHTLS Wide. Therefore, 



to match the statistical power expected for cosmic shear in the 
future surveys, it is essential to control these systematics. 

The intrinsic alignment, II (III) in the two- (three-) point 
case, is relatively straightforward to eliminate, since it re- 
quires that the galaxies in consideration are physically close 
to each other, i.e. have very similar redshifts and angular 
positions dKing & Schne ider 2002| 120031: IHevmans & Heavensl 
2003; Tak ada & Whitel2004l) . The control of intrinsic-shear sys- 
tematics, GI for the two-point case and GGI in the three-point 
case (Gil also requires that two of the three galaxies are phys- 
ically close and thus can be eliminated in the same way as II 
and III), turns out to be a much greater challenge. However, as 
already pointed out by HS04, the characteristic dependence on 
galaxy redshifts is a valuable piece of information that helps to 
control the intrinsic-shear alignments. 

Several methods for this have already been constructed in 
the context of two-point statistics. T hey can be roughly classi- 
fied i nto three c ategories: modeling (|Kiti g 20051: IBridle & Kind 
20071). nulling (iJoachimi & Schneider! 120081 JS 08 hereafter ; 



Joachimi & Schnei derll2009l) and self^calibration (IZhand f2008; 
Joachimi & Bridle 2009). Modeling separates cosmic shear from 
the intrinsic-shear alignment effect by constructing template 
functions for the latter. It suffers from uncertainties of the model 
due to the lack of knowledge of the angular scale and redshift 
dependence of the intrinsic-shear signal. The nulling technique 
employs the characteristic redshift dependence of the intrinsic- 
shear signal to "null it out". It is a purely geometrical method and 
is model-independent, but suffers from a significant information 
loss. Self-calibration intends to solve the problem of information 
loss by using additional information from the galaxy distribution 
to "calibrate " the signal . The original form of self-calibration, 
proposed by Zhang (2008]), is model-independent but strong as- 
sumptions have been made. Hoachimi & Bridle! (120091) then de- 
velop it into a modeling method, by treating intrinsic alignments 
and galaxy biasing as free functions of scale and redshift. 

All these methods have the potential of being generalized 
to three-point statistics. In this paper we focus on the nulling 
technique, and establish it as a method to reduce the three-point 
intrinsic-shear alignments GGI and GIL Since Gil can be re- 
moved by discardi ng close pairs of galaxies as in the case of 
II controlling (e.g. IHevmans & Heavensll2003l) . we focus on the 
control of GGI systematics. 

As known from the case of two-point statistics, the nulling 
technique introduces significant information loss while (in prin- 
ciple) completely removing the intrinsic-shear alignment from 
the signal. In this work we compare the nulling technique to an 
unconditioned linear compression of the data, distinguish differ- 
ent sources of such information loss, and discuss the possible 
ways of reducing it. We also study the combined constraints on 
cosmological parameters with both two- and three-point cosmic 
shear statistics. 

In Sect. 2 we demonstrate why and how the nulling technique 
can be applied to three-point lensing statistics. We then apply the 
nulling technique to the modeled lensing bispectrum which we 
contaminate by intrinsic-shear alignment. The modeling details 
are described in Sect. 3. The method of nulling weights construc- 
tion and the corresponding results are shown in Sect. 4, while the 
results concerning the constraints on cosmological parameters 
are presented in Sect. 5. We conclude in Sect. 6. 

We will work in the context of a spatially flat CDM cosmol- 
ogy with a variable dark energy whose equation of state w is 
parameterized as w — wq + w a {\ - a), with a the cosmic scale 
factor. The adopted fiducial values for cosmological parameters 
are Q m = 0.3, Q. b = 0.045, Q de = 0.7, w = -0.95, w a = 0.0, 
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h = 0.7, n s — 1.0, and <x 8 = 0.8. Here, Q m , Qb and Qd e are 
the density parameters of the matter (including cold dark matter 
and baryons), baryons and the dark energy at present time, n s 
is the spectral index of the primordial power spectrum of scalar 
perturbations, h is the dimensionless Hubble parameter defined 
by //o = 100/;km/s/Mpc, and cr 8 is the rms mass fluctuation in 
spheres of radius 8/; _1 Mpc. 



2. The nulling technique applied to three-point 
shear tomography 

2. 1 . Principle of the nulling technique 

The shear on the image of a distant galaxy is a result of grav- 
itational distortion of light caused by the inhomogeneous three- 
dimensional matter distribution in the foreground of that galaxy. 
For notational simplicity, we will use the dimensionless surface 
mass density (the convergence) k instead of the shear y as a mea- 
sure for the lensing signal throughout the paper, although in re- 
ality the signal is based on the measurement of the shear. This 
will not affect our results since k and y are linearly related on 
each redshift plane while our method is dealing with the redshift 
dependence of them (the same reason justifies the turning to the 
Fourier domain in the next subsection). 

When one measures the shear y, the direct observable is the 
galaxy ellipticity e t, s = e\ + y. The shear y is a signal caused by 
gravitational distortion which is a deterministic process, where 
the intrinsic ellipticity e\ can be further written as the sum of 
a deterministic part £j det which is caused by intrinsic alignment, 
and a stochastic part e[ an which does not correlate with any other 
quantity. There is no correlation between e[ an of different galax- 
ies either. 

We define k \, s and k\ which are the correspondences of 
ej det + y and e det . We remove the stochastic part since k is deter- 
ministic. Note that k q \, s and k\ are analogs of the dimensionless 
surface mass density k but do not have any direct physical mean- 
ing as k does. They are complex quantities in general and can 
lead to a B-mode signal. To better distinguish the real measur- 
able k from them, we denote it as kq in the rest of the paper since 
it is the physical quantity which is related to the gravitational 
lensing signal. Keeping the dominating li near term, the co nver- 
gence kq can be written as (details see e.g. [Schneider 2006): 



One then uses this weight function as a weight for integrating 
over the source distance: 



3 £l m H 2 

2c 2 



r 

Jo 



dx 



Xs a(x) 



(6) 



where 5 is the three-dimensional matter density contrast, x* is 
the comoving distance of the background galaxy which is acting 
as a source, and a(x) is the cosmic scale factor at the comoving 
distance^ of 6 which is acting as a lens. 

Equation (O clearly shows that the contribution of the mat- 
ter inhomogeneity 6 at comoving distance^, to the cosmic shear 
signal of background galaxies can be considered as a function 
of the source distance Xs> an d this function is proportional to 
1 - XilXs- The nulling technique takes advantage of this char- 
acteristic dependence on source distance Xs by constructing a 
weight function T(xi,Xs) such that the product of T(Xi,Xs) an d 
1 -XilXs has an average of zero on the range between^-, and the 
comoving distance to the horizon Xhor'- 



f^dx s T(xi,Xs) (l-^) = 0. 



kg(Xi>6) : = I dx, T(xi,Xs) k g (0,x s ) ■ 



(8) 



The resulting new measure of shear signal kc(xi, 9) is then free 
of contributions from the matter inhomogeneity at distance^,. 
Note that although the weight function T has two arguments x\ 
and^ s here, we consider it as a function of Xs for a particular^,. 

Consider a correlator (fj, bs *^ bs ) with comoving distances 
Xi < Xj- With a similar decomposition as ([TJi, it is straightfor- 
ward to see that the GI term in it is ^[k^. The term U^j*^) 
vanishes since the lensing signal at Xi is correlated only with 
matter with x ^ Xi' whereas k' y originates solely from physical 
processes happening aiXj- If we integrate (k[k g ) over^ with 

a weight function that eliminates the contributions to k j g by the 
matter inhomogeneity at distance^,, this correlator will also van- 
ish, 



( 'dXjT(Xi,Xj) (*r4) = ' 



(9) 



since it is just the matter inhomogeneity at distance Xi that gives 
rise to the correlation between a-J and k j q . Thus, when we inte- 
grate over (^o bs fo bs ) with the same weight function, the GI con- 
tamination in it will be "nulled out". Equation (f7]i is the condition 
that the weight function T should satisfy in order to "null" the 
intrinsic-shear alignment terms, so we call it "the nulling condi- 
tion". 

The same applies to three-point statistics. Consider a cor- 
relator (fobs^hs^bs) with Xi being the smallest comoving dis- 
tance of the three. Both Gil and GGI systematics contained 
in it also originate from the matter inhomogeneity at distance 
Xt- Typically, the generation of Gil systematics requires that 
Xi ~ Xj < Xk, while the generation of GGI requires Xi < Xj 
andxi < Xk- For both cases, the dependence of Gil or GGI sys- 
tematics onxk is also just 1 - XilXk- So new measures built as 
j^ h " dxk T(Xi,X],Xk) (""ob^obs^obs) with T satisfying the nulling 
condition for three-point statistics 



f 



dXk T(xi,Xj,Xk) 1 



— 1 = 

Xk 



(10) 



(7) 



will be free of both Gil and GGI contamination. Again, 
T(xi,Xj'Xk) here should be seen as a function of Xk whose form 
depends on^, and^. 

Note that this method only depends on the characteristic red- 
shift dependencies of the lensing signal and intrinsic-alignment 
signals, and is not limited to E-mode fields. This is a reassur- 
ing feature since while the kg field is a pure E-mode field to 
first order, the k\ field can have a B-mode component. However, 
if parity-invariance is assumed, any correlation function which 
contains an odd number of B-mode shear components vanishes 
(ISchneiderl2003l) . thus there should be no B-mode component in 
the GGI signal. 

2.2. Nulling formalism for lensing bispectrum tomography 

Since the nulling technique relies on the distinct redshift de- 
pendence of the intrinsic-alignment signal, redshift information 
is crucial for it. With the help of infrared bands, forthcoming 
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multicolor imaging surveys can provide rather a ccurate photo- 
metri c redshift information for the galaxies (e.g. lAbdalla et al.l 
2008; iBordoloi et alj 120101) . allowing tomographic studies of 
cosmic shear statistics. We base our study on cosmic shear bis- 
pectrum tomography, and outline the corresponding formalism 
of the nulling technique in the following. 

Given the galaxy redshift probability distribution of redshift 
bin i which we denote as p$ (z) = Ps(Xs)dXs/dz, one can de- 
fine the average convergence field in redshift bin ; by integrating 
k(0,Xs) in © over p\ (Xs)- We turn to angular frequency space 
now and define 



i»' ( f> 



Jo 



dx%p[ Ofs) kg{{,Xs) 



(11) 



where kq{(,Xs) is the Fourier transform of kg(0,Xs)- To better 
show the relation between ic? (i) and three-dimensional matter 
inhomogeneity in Fourier space 6(k,x), one can write, basing 
on © and ( fTTT i. 



K®(f)= dxW (i) (x)~6(e/x,x) . 
Jo 

by defining a lensing weight function W^(x) as 



dXs p\ 

y 



2a(x)c 2 J x \ 



(13) 



The tomographic lensing bispectrum is defined via 



(*®(A)4%2)^3)) = {2n) z B { ^ G {t u h, h) S D (ti +e 2 + h) , 

(14) 

where the Dirac delta function ensures that the bispectrum is 
defined only when t\, 1%, and (3 form a triangle. This fact arises 
from statistical homogeneity, while that the bispectrum can be 
defined as a function independent of the directions of the angular 
frequency vectors arises from statistical isotropy. 

In a survey, the convergence field £ bs is determined from the 
observed galaxy ellipticities, and the corresponding bispectrum 
B b s suffers from intrinsic-shear alignments. As we did with the 
three-point correlator in Sect.Q] we separate the observed lens- 
ing bispectrum into the four terms: 



Sobs = Bqqq + Z?ggi + Ban + #11 



(15) 



Among them, Z?ggi, #gii and Z?ni can be linked to the conver- 
gence in a similar way as (TBI , for example 

(^1)^2)^3)) = (InfB^tx, h, h) 5 D (tx +t 2 + (3) ■ 

(16) 

Here we assume disjunct redshift bins and let ; to be the red- 
shift bin with the lowest redshift, so (K®(e 1 )K^(( 2 )Z G x \hj) and 

/&®(/'i)&Q' ) (/'2)£p(^3)) both vanish due to the same reason as ex- 
plained in Sect. l2.1l for the two-point statistics. 

The purpose of the nulling technique is to filter fi Q bs in such 
a way that the GGI term is strongly suppressed in comparison 
with the GGG term. The Gil and III terms can be removed by 
ignoring the signal coming from bispectrum B^(£\,{2, (3) with 
two or three equal redshift bins. 

To fulfill this purpose, we construct our new measures as 

N- 

T^Htuh, (3) := £ T (U) (x k ) B^({ u h, (3) X k Az* , (17) 

k=i+l 



where N z is the total number of redshift bins, xl is the deriva- 
tive of comoving distance with respect to redshift, and Azk is the 
width of redshift bin k. The weight function is written now as 
T^'^ixk) since i and j indicate two redshift bins, i.e. two popu- 
lations of galaxies, rather than two comoving distances as in the 
previous subsection. The weight is required to satisfy the 
nulling condition (0 in its discretized form, 



(18) 



m.- j| T {i Hxk) (i-g]^Az, = o, 



k=i+l 



for all j > i. Here, Xi and Xk should be chosen such that they 
represent well the distance to redshift bins i and k. In this paper 
we choose them to be the distances corresponding to the median 
redshift of the bin. The summation over index k runs from i + 1 
rather than i since we consider only bispectrum measures with 
j > i and k > i to avoid III and Gil systematics. In this case B^ 

in ( fT77T > can be written as a sum of B^ c and B^S, and T^ can 
be expressed as 



'GGI' 



(12) Y^(( u ( 2 , t 3 ) = £ T^(x k ) B^ituh, (3) X' k Az* 



k=i+l 



(19) 



k=i+l 



Suppose one has infinitely many redshift bins, then the lens- 
ing signal in bin k caused by the matter inhomogeneity in bin i 
is exactly proportional to 1 ~Xi/Xk> which means S™(A>^2! ^3) 
can be written as a product of 1 —XilXk and some function of the 
parameters other than^: 



B { ^(d , h, € 3 ) = r(Xi,Xj, *i, ^3) 1 1 - — 



Xi 



Xk 



(20) 



Then we have 

N- 



z 

k=i+\ 



T^ix k ) B^gituhMx'k^k = 



(21) 



k=i+l 



This suggests that only the GGG contribution is left in the nulled 
measure Y^'^, the GGI contribution has been "nulled out" due to 
the nulling condition. If only a limited number of redshift bins 
is available, (l20l holds only approximately, leading to a residual 
in (ED- 

Since the nulling condition is the only condition that the 
weight r (,j) must satisfy in order to "null", there is much free- 
dom in choosing the form of it. We would like to further specify 
its form such that it preserves as much Fisher information in 
as possible. The method we have adopted for the nulling weight 
construction will be detailed in Sect.|4] 

Note that for each (/, j) combination, one can in principle 
apply more than one nulling weight to the original bispectrum, 
and obtain more nulled measures. If one retains the condition 
of maximizing the Fisher information and demands that all the 
weight functions built for one (2, j) combination are orthogonal 
to each other, one arrives at higher-order modes that have the 
second-most, third-most, etc., information content (higher-order 
weights, see JS08). The total number of such linearly indepen- 
dent nulled measures for a certain (/, j) equals the possible val- 
ues of k > i + 1. In this schematic study we will only use the 
optimum, i.e. the first-order nulling weights. We will assess the 
information loss due to this limitation in Sect. 15. 41 
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3. Modeling 

3.1. Survey characteristics 

We set up a fictitious survey with a survey size of A = 4000 
deg 2 which is similar to the survey size of DES. This can be 
easily scaled to any survey size using the proportionality of sta- 
tistical errors to A~ l/2 . We assume a galaxy intrinsic ellipticity 
dispersion cr e - cr(e[ an ) = 0.35. As galaxy redshift probabil- 
ity distribution we adopt the frequently used parameterization 
(SmailetalHHl, 



another approach exclusively based on the two-dimensional 
Fourier formalism was constructed, which we use here. 

The bispectrum covariance is a six-point correlation function 
which can be expanded in to its connected parts as outlined in e.g. 
iBernardeau et alj (|2002a). As argued in TJ04, for angular scales 
Z < 3000, the term which is a triple of two-point functions still 
dominates. Keeping only this term, the bispectrum covariance 
reads 



Ps(z) 



exp< 



(22) 



Cov (flg^tfi , h, h), h, hi) 



(2tt) 3 



and use zo = 0.64, a = 2, B - 1.5. The distribution is cut at 
Zmax = 3 and normalized to 1. The corresponding median red- 
shift of this fictitious survey is z m = 0.9, which is compatible 
to a survey like EUCLID. We adopt an average galaxy number 
density « g = 40 arcmin -2 which is again EUCLID-like. 

Disjunct redshift bins without photo-z error are assumed, 
which means that the galaxy redshift probability distribution in 
redshift bin i takes the form p\ l \z) °c p s (z) if and only if the 
redshift that corresponds to comoving distance Xs is within the 
boundaries of redshift bin i. A number of 10 redshift bins are 
used by default. The boundaries of the redshift bins are set such 
that each bin contains the same number of galaxies. 

We adopt 20 angular frequency bins spaced logarithmically 
between f min = 50 and 4, ax = 3000, and denote the charac- A(( U { 2 J3) = 
teristic angular frequency of a bin as {. Within this range the 
noise properties of the cosmic shear field are still not too far in 
the non-Gaussian regime, allowing a more realistic theoretical 
estimation of the bispectrum and its covariance. Whether this 
number of angular frequency bins can reconstruct the angular 
frequency dependence of the bispectrum is tested, and 20 bins 
are found to be sufficient for our requirements on precision. This 
is also expected since the bispectrum is rather featureless as a 
function of angular frequency. 



A til-ils A{ l A( 2 A( i 

x {p (i \€,)P { j' n \h)P (kn \h) 5- eA S hh 8- hh + Sperms.), 



(24) 



in which Alj is the bin width of the angular frequency bin with 
typical value and P^(f) is the observed po wer spectrum 
which contains the intrinsic e llipticity noise (e.g. lKaiserlfl99l 
lHulll999LlJoachirrii et al.ll2008h : 



2tl; 



(25) 



where n, is the galaxy number density in redshift bin i. The term 
A ((1,(2, {3) is defined as 







if \d 
else. 



■ h\ <h<lx+h. 



(26) 



3.2. Bispectrum and its covariance 

We show the modeling of Z?ggg and its covariance in this 
section. We will only consider the tomographic bispectrum at 
redshift bins satisfying z, < zj and zi < Zk, which already ensures 
an elimination of Z?m and Z?gii systematics in our case. 

Applying Limber's equation, it can be shown that the tomo- 
graphic convergence bispectrum can be written as a projection 
of the three-dimensional matter bispectrum Bs(ki,k 2 , ky,x) (see 
e.g. TJ04): 



£3) = 



GGG 1 

/*w W®(x)WU(x)WW(x) 

dx 5 

Jo A 



„ . -1 h h 
Bel— >— >— <X 



(23) 



To compute Bg, we empl oy the fitting formula by 
IScoccimarro & Couchmanl (1200 lh. which is based o n hyper- 
extended perturbation theory ( IScoccimarro & Friemaniri999l) . A 
comparis on of this formula wit h the halo model results can be 
found in lTakada~& Jainl(l2003allbh . 

Estimating the bispectrum covarianc e is often done within 
a flat-sky spherical harmonic formalism (Hu 2000, HuOO here- 
after), which suffers - at least formally - from drawbacks since 
its basis functions Y/ m are only defined f or discrete angu l ar fre- 
quency values and full sky coverage. In iJoachimi et all (2009) 



When \€i - € 2 \ < €3 < €1 + € 2 is satisfied, A^ 1 (€ U { 2 J3) is the 
area of a triangle with side lengths {1,(2 and {■$. We use the 
lEisenstein & Hul £l998) transfer function to evaluate the linear 
three-d imensional matter power spectrum, and the Smith et al. 
(2003) fitting function for the nonlinear power spectrum. 

Note that there is no intrinsic ellipticity noise in the observed 
bispectrum, since the galaxy intrinsic ellipticity distribution is 
assumed to be skewless. 

Under the assumptions of a compact survey geometry and 
scales much smaller than the extent of the survey area, (l24l i 
provides a bispectrum covariance that naturally incorporates the 
scaling with survey size, is not restricted to integer angular fre- 
quencies, and allows for any appropriate binning. In terms of 
Fisher information, the result gi ven by this approac h and the 
HuOO one agree to high accuracy (IJoachimi et alJl2009l) . 

3.3. Toy intrinsic-shear alignment model 

In this section we present a toy model for generating GGI 
systematics. Since the physical generation of intrinsic-shear 
alignments concerns nonlinear growth of structure and complex 
astrophysical processes which are not easy to quantify, a realistic 
model is not yet available. Current simulations involving bary- 
onic matter also have some way to go before they can simulate 
the generation of the GGI systematics reliably. 

Up to now there has not been any attempt to measure GGI 
and Gil in galaxy surveys. ISemboloni et al.l (2008) studied these 
systematics using ray-tracing simulations. They provided fits in 
real space to projected Gil and GGI signals, but the results are 
still too crude to lead to sufficient constraints on an intrinsic- 
shear alignment model. 

This situation emphasizes the importance of a method 
intended to control intrinsic-shear alignment to be model- 
independent, especially at the three-point level. Since this is the 
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case for the nulling technique, for this work we only require a 
simple model for Bq{J which satisfies the characteristic redshift 
dependence and leads to a reasonable bias. 

Based on the observation that the lensing bispectrum ex- 
pression ( 1231 comes directly from (TTZl i and the definition of 
the tomography bispectrum 0141 . we link B^^ also to a three- 
dimensional bispectrum B s ,ss via 



^GGl(A.^2,^3) - 

sn» pfix^ix^ix) D th h h. ^ (27) 



f 

Jo 



AT \x x x 



Similar to B${k\ ,k 2 ,kj,) which is given by 

(S(*i,*)S(*2,Ar)3(*3,*)) = 



(2tt) 3 fo(Jfc, + * 2 + Jfe 3 ) B,(*i , k 2 , HxX 



(28) 



is defined via 



(6,(ki,x)S(k2,x)Mk 3 ,x)) = 



(2tt) 3 6 D {ki + k 2 + k 3 ) B SrS s(kuk 2 , k 3 ;x), 



(29) 



where 6\{k) is the three-dimensional density field which is re- 
sponsible for the intrinsic alignment, and it satisfies 



(30) 



The definition of both and S\ originates from the deter- 
ministic part of galaxy intrinsic ellipticity ^ et . We have as- 
sume the existence of these unde rlying smooth fie l ds. Si milar 
quan tities have been define d injjoachimi & Bridle (2009), see 
also lHirata & Serjakl (12004 and lSchneider & Bridld ?2010). We 
would like to point out again that, although we introduce these 
quantities for the clarity of our model, we do not need them for 
the main purpose of this paper. What we need to model is the 
projected GGI bispectrum B^2. 

Note that in (l27l i. the weight for the lowest redshift bin i is 
the source distribution function p$ which is zero outside red- 
shift bin i, rather than the lensing weight W w which is a much 
broader function. Since depends only on physical processes 
at redshift bin i and is inferred from ellipticity measurements in 
this bin, and Kj? is linked to the three-dimensional matter den- 
sity through the lensing weight this assignment of weight 
functions will ensure the correct redshift dependence of B^^. 

When the redshift bins are not disjunct, however, the intrinsic 
alignment signal can no longer be associated with bin i. There 
will be two permutations in both the left-hand side of (TT~6b and 
the right -hand side of (| 27T>. similar to the two-point case, e.g. 
Eq. 1 1 in lHirata & Seliakl 020041) . 

The modeling of Bs,ss is then a pure matter of choice. We 
build a simple three-dimensional GGI bispectrum with power- 
law dependence on both redshift z and spatial frequency k: 



B Sl 5s {k\,k 2 , h;x) : = B s (k ief , k ref , k Ki ;x(z me d)) 



1+z 

1 + Zmed 



-2 



\k re f 



.2(1-2) 



M-ef 



2(i-2) 



k K f 



Us-2)} 



where z me d is the median redshift of the whole survey, and 3\, 
k K f, r, s are free parameters. Among them the parameter k re f 
is designed to be a characteristic wave number, whose value 
we set to be a weakly nonlinear scale of 10 /iMpc 1 here. The 
minus sign ensures that the contamination of GGI systematics 
leads to an und erestimation of the GGG signal, as found by 
ISemboloni etail (120081) . 

Little is known about the redshift and angular scale depen- 
dence of Bs,ss- However one can roughly estimate how it com- 
pares to the Bsss signal. A linear ali gnment model suggest s 
Si °c (5ii„p(z)/((l + z)D + (z)) (see e.g. iHirata & Seliakl 12004). 
in which p(z) is the mean density of the universe, D + (z) is 
the growth factor, and 5a n is the linear matter density contrast. 
Thus we have, very roughly, 5i oc (1 + ^) 3 5 lin which suggests 
B$,gs x (1 + z) 3 Bssd- The linear alignment model assumes that 
the intrinsic alignment is linearl y rela ted to the local ti dal gra vi- 
tational field (e.g. lCatelan et al.ll200U IHirata & Seliakll2004|) . If 
this holds true, we also expect Bs,ss to have a stronger angular 
scale dependence than Bsss since tidal gravitational interaction 
follows the inverse cube law rather than the inverse square law 
which gravity itself follows. For a ACDM model, in the weakly 
nonlinear regime where perturbation theory holds, the depen- 
dence of Bsss on (1 +z) has a negative power shallower than -4, 
and the dependence on k has a power of around -2. In this paper 
we choose r — 0, s — 1 as default. We also study the cases of 
r = -2, r = 2, and 5 = whose results will be shown in Fig. [4] 
below. 

As for the amp litude 3\ of the GGI sig nal, the only direct 
study up to now is ISemboloni et a l. (2008), which suggests an 
overall GGI/GGG ratio of 10% for a z m = 0.7 survey for el- 
liptical galaxies and few percent for a mixed sample of ellipti- 
cal and spiral galaxies. In this paper we adjust such that the 
amplitude of the tomographic GGI bispectrum is limited to be 
within 10% of the amplitude of the lensing GGG signal, i.e. 
GGI/GGG< 10 % at redshift bin combinations with Zi <s Zj and 
Zi <k Zk where the GGI signal is expected to be most signifi- 
cant. This leads to a relatively modest overall GGI/GGG ratio 
at percent level. We will show examples of the generated GGI 
and GGG signals in Fig. [3] As an order-of-magnitude estimate, 
one can also relate the GGI/GGG ratio to that of GI/GG by ex- 
panding three-point signals to couple s of two-point signals using 
pertur bation theory, in analogy to the lScoccimarro & Couchmanl 
(2001) fitting formula. For the case of z, «; z; ~ Zk, the leading 
order terms would give that the GGI/GGG ratio approximates 
that of GI/GG evaluated at redshifts z, and Zj- This suggests that 
our adopted GGI/GGG ratio is also con sistent with available ob- 
serva t ional studies of th e GI signal {M andelbaum et al. 2006, 
2009 : IHirata et all 120071: iFu et alJ 120081: lOkumura et al J 12009: 
lOkumura & Jing|2009l) . although the results of these studies vary 
a lot according to different median redshift, color and luminosity 
of the selected galaxy sample. 



4. Construction of nulling weights 



(31) 



As mentioned in Sect. 12.21 we would like to construct a sin- 
gle first-order weight function T^'^(x) for each (/, j) combination 
which preserves the maximum of information. This can be seen 
as a constrained optimization problem. The constraining condi- 
tion here is the nulling condition and the quantity to be optimized 
is the Fisher information after nulling. In JS08, several practical 
methods were developed to solve this optimization problem at 
the two-point level, and very good agreement was found among 
the different methods. 
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Fig. 1. Distribution of the nulled Fisher information as defined in ( f40b per (4,4, 4) bin and per redshift bin combination among 
different angular frequency triangle shapes and sizes. Results for four redshift bin combinations (2, j) are presented. Left panel: 
Distribution of the nulled Fisher information among different triangle configurations. We consider triangles with the common 
shortest side length 4 - 171 which corresponds to the 7th angular frequency bin. Due to our logarithmic binning and the constraint 
that the three side lengths must be able to form a triangle, only 8 such triangle configurations exist. Plotted is the nulled Fisher 
information contained in these 8 triangles against a, which is the angle opposite to the shortest side length in that triangle. Smaller 
a correspond to more elongated triangles, and larger a correspond to almost equilateral triangles. Right panel: Distribution of the 
nulled Fisher information contributed by each (4,4, 4) bin over different triangle sizes. A fixed triangle shape with t\ : 4 ■ 4 = 
1 : 3.64 : 4.52 (corresponds to the leftmost points in the left panel) is chosen. The nulled Fisher information contained in one 
(4,4, 4) bin is plotted against the shortest side length l\ of each triangle. 



We adopt the simplified analytical approach as described in 
JS08, and reformulate it for three-point statistics here. For con- 
venience we introduce the following notations: 
the bispectrum covariance matrix CovB, whose elements are 

(32) 

the covariance matrix CovY of the nulled bispectra Y, whose 
elements are 



covy(| Ai v tut) Cov ( y(W (4, 4, 4), y {lm \u, 4, 4)) 

= Z Z C ° V (O^. 4, 4)) 

k= "= 

i+1 w 

x T (i Hxk)T {bn) {j(n)x' k x'n^ k Az„; 



(33) 

a vector B >A , whose elements are partial derivatives of the bispec- 
trum with respect to the cosmological parameter p M 



iiAA ' dp M 



(34) 



and a corresponding vector Y J(U for nulled bispectra Y, whose 
elements are 



(35) 



Then the Fisher information matrix from the original bispec- 
tra can be written as (following TJ04) 



F; v = B^ CovB- 1 B, v , 
and that from the nulled bispectra can be written as 



Y „ CovY" 1 Y 



(36) 



(37) 



Here the matrix multiplication is a summation of all possi- 
ble angular frequency combinations (4,4,4) and redshift bin 
combinations, (ijk) for the original bispectra and (if) for the 
nulled bispectra. In ([36j» and d37j, CovB 1 and CovY 1 indi- 
cate the inverse of the covariance matrix. When the covari- 
ance is approximated by triples of power spectra, the covari- 
ance between two different angular frequency combinations 
(4,4,4) & (4,4,4) is zero, see (l24l i. which means that the 
covariance matrix is block diagonal. In this case the matrix in- 
version can be done separately for each block specified by an 
angular frequency combination (£1,(2, 4)- 

According to the idea of the simplified analytical approach, 
we consider the Fisher information on one cosmological parame- 
ter contained in bispectrum measures B^L{£\ , 4, 4) with a sin- 
gle (4,4, 4) combination and with redshift bin (i, j, k) combi- 
nations having common (/, j) indices. For every (i, j) combina- 
tion we build nulling weights T ( '^ which maximizes the nulled 
Fisher matrix using the method of Lagrange multipliers. Since 
here the nulled Fisher matrix receives contribution only from 
certain angular frequency and redshift combinations, we denote 
it as Fq to avoid ambiguity. F^ has only one component since 
only one cosmological parameter is taken into consideration. As 
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only a single (€1,^2,^3) combination is involved, we will omit 
the ^-dependence in all variables in the rest of this subsection to 
keep a compact form. 

Again for notational simplicity, we follow JS08 and intro- 
duce a vector notation as follows. For each (i, j) in considera- 
tion, let the values of the weights T^'ixk) form a vector T - T^, 
and define another vector p and a matrix C with elements 

Pk := Bj' k )x' k Az* , (38) 

C kn := CovB(* ijn ) X ' k Xn Az* Az„ . (39) 
Thus Fq can be expressed, according to (|37T >. as 



Fi' 7) := Y I(U ('>) CovY-'C^; «) Y jjU ( y ) = ^g^. (40) 
We further define a vector / with elements 

A -(1 <«> 

to write the nulling condition (TT8l as 

- x -f = . (42) 

The problem of finding nulling weights T which maximize 
Fg J ^ under the constraint given by the nulling condition can be 
solved with the method of Lagrange multipliers by defining a 
function 

G := F ( i j) + AO m = (T "_ P) + AT ■ f (43) 

t t ct 

with A being the Lagrange multiplier, and setting the gradient of 
G with respect to T to zero, 

V r G = 2p V-LEL -2CT (t—J^) + Af = . (44) 

r T cr \t t ct) 

The solution to this equation is (for more details see JS08) 

r = Ar{c-V-^§^c- 1 /} . (45) 

with the normalization N adjusted to give |!T| 2 = 1. 

Apparently the thus constructed nulling weights depend on 
which (€1,^2,^3) combination is considered and with respect to 
which cosmological parameter we optimize the information con- 
tent. In this paper the default cosmological parameter to optimize 
is Q m , and we choose for each combination the (2, £3) 
combination which maximizes F„ . However one needs to be 
aware that this serves only as a clear choice of a (€1, €2, £3) com- 
bination and is not necessarily the best in terms of information 
preservation considering all angular frequency bins and all cos- 
mological parameters. 

To show which triangle shapes and sizes contain more in- 
formation, we plot F„ against the (C\, (2, (3) triangle shape and 
size for four typical (/, f) combinations in Fig.Q] In the left panel, 
the nulled information F^ contained in different triangles with 
a common shortest side length i\ = 171 is plotted against a, 
which is the angle opposite to ~t\ . Due to our logarithmic bin- 
ning in angular frequency, only eight (^1,^2,^3) combinations 




0.0 0.5 1.0 1.5 2.0 2.5 3.0 

z 

Fig. 2. Nulling weights T (l ' ] for redshift bins i = 1, j = 2 are 
plotted against the redshift value of the third redshift index k. 
Remarkable consistency is found between nulling weights opti- 
mized on different parameters, shown with different line styles. 

with l\ = 171 can form triangles. One sees that the more elon- 
gated triangles (small a) contain much more Fisher information 
than the almost equilateral triangles (large a). The small separa- 
tion between the 3rd and the 4th points from the left is caused by 
the degeneracy of different triangle shapes with respect to a, e.g. 
two equal and very long side lengths can result in the same value 
of a as two shorter side lengths with a length difference close to 
the length of the shortest side length. The right panel shows the 
distribution of the Fisher information contained in one (i\ , (2, ^3) 
bin over the triangle size. When the redshift in consideration is 
higher, the peak of the information distribution moves to higher 
angular frequencies. The figure suggests that most information 
comes from high redshifts and small angular scales. 

To explore the sensitivity of nulling weights on the choice of 
the cosmological parameter, we construct seven sets of weight 
functions, each optimizing the information content in terms of 
one parameter. For all (2, j) combinations we find that the nulling 
weights are not very sensitive to the choice of parameter. As an 
example, the weights for (z, j) = (1,2) are shown in Fig. [2] This 
result is rather surprising at first sight, since for different parame- 
ters the distribution of information (contained in the bispectrum) 
over redshift bins is quite different. However, such insensitivity 
suggests that the shapes of nulling weights are already strongly 
constrained under our construction scheme. One constraint is, 
evidently, the nulling condition. Moreover, considering the fact 
that we optimize the nulling weights for each (z, j) combination 
with respect to the information content they preserve, we have 
already required the shapes of these first order nulling weights 
to be as smooth as possible. 

The fact that these two conditions have already imposed 
strong constraints on the nulling weights also suggests that 
nulling weights can be robustly and efficiently constructed, i.e. 
it is not critical to construct the "best" nulling weights. 

5. Performance of the nulling technique 

5.1. GGI/GGG ratio 

What the nulling technique "nulls" is the GGI signal Z?ggi, 
so the GGI/GGG ratio is the most direct quantification of its per- 
formance. We plot the modeled GGI and GGG bispectra before 
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Fig. 3. Tomographic convergence bispectrum (GGG, solid curves) and intrinsic-shear alignment (GGI, dashed curves) for equilateral 
triangles are plotted against triangle side length. Measures both before (left panel) and after (right panel) applying the nulling 
technique (B and Y respectively) are shown for three typical redshift bin (i, j) combinations in the three rows. 



and after nulling in Fig. [3] The original GGI signal is shown in 
the left panels by dashed lines. For comparison the GGG signals 
are shown as solid curves. The results are shown for equilateral 
triangle configurations for the convenience of presenting. One 
sees that when the redshift bin number j and/or k increase, the 
changes in GGG and GGI signals are different, which shows the 
expected different redshift dependence. For all redshift bin com- 
binations the GGI signal is modeled to be subdominant to the 
GGG signal. In the nulled measures shown in the right panels, 
the GGI/GGG ratio is suppressed by a factor of 10 over all an- 
gular scales, which reflects the success of the nulling technique. 



Thus, to calculate the full information content, we use general 
triangle configurations but limit our calculation to triangles with 
three different side lengths, again for reasons of simplicity (for 

details see AppendixlAl). 

We will use the figure of merit (FoM. lAlbrecht et al.|[2006l) 
to quantify the goodness of parameter constraints. Here the FoM 
for constraints in the parameter plane p a - pp is defined to be 
proportional to the inverse of the area of the parameter constraint 
ellipses: 

FoM(p„, P/? ) = ((F-'WtF-'V - (F- 1 )^) 4 . (46) 



5.2. Information loss and downweighting of systematics 

We further evaluate the performance of the nulling technique 
by looking at the constraining power of cosmic shear bispectrum 
tomography on cosmological parameters, as well as the biases 
caused by the GGI systematics before and after nulling. 

The full characterization of the bispectrum involves three an- 
gular frequency vectors which form a triangle. In some works 
concerning three-point statistics, only equilateral triangle con- 
figur ations i.e. l\ - (2 — ^3 = t are used for simplicity reasons 
(e.g. lPires et al. 2009). But as several authors have pointed out 



(e.g. Kilbinger & Schneider 2005; Berge et al. 2010), only a low 
percentage of information is contained in equilateral triangles. 



To compute biases, we adopt a method b ased on a simple ex- 
tension of the Fisher matr ix formalism (e.g. lHuterer et alj r2006; 
lAmara & Refregier| [2008). Then one needs to define a bias vec- 
tor B GGI which in our case reads: 



with 



iGGI 



= Bggi CovB- 1 B 



YccnCovY- 1 Y 



B ° GI &/3 ) : = 



# GGI (£l, 



-2, <-3) 



(47) 



(48) 



(49) 
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(50) 



The bias of the parameter estimator p M is given by the differ- 
ence between its ensemble average and the fiducial value of the 



parameter p^f: 



,GGI 



(51) 
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The information content before and after nulling can be seen 
in Fig. [6] On the cost of increasing the error on each parameter 
to about twice its original value, GGI systematics are reduced to 
be within the original statistical error. The relative information 
loss in terms of FoM can be found in TableQ] The constraints 
shown in Fig.|6]do not represent the best constraints obtainable 
from a cosmic shear bispectrum analysis since we consider only 
the triangles with angular scale i\ + I2 + ^3- Also note that the 
nulling technique can in principle remove the GGI systematics 
completely. But as shown in Fig. [(J the systematics still cause 
some residual biases on cosmological parameters after nulling, 
due to the finite number of redshift bins. The GGI systematics 
will be reduced to a lower level when more redshift bins are 
available. We will discuss this further in the following subsec- 
tion. 



5.3. How many redshift bins are needed? 

Analyzing the cosmic shear signal in a tomographic way 
was originally meant to maximize the information. For t his pur - 
pose alone, a crude redshift binning will suffice <|Hu|[l999). 
However, to control intrinsic-shear alignment, which is a 
redshift-dependent effe ct, much more detailed redshift infor- 
mation is required (e.g.lKing & Sc hneider 2002; iBridle & King! 
l2007tljoachimi & Schneiderl2008l) . Thus, for a method intended 
to eliminate intrinsic-shear alignment, it is necessary to show its 
requirement on the redshift precision. In the case of nulling, de- 
tailed redshift information is not only needed for the method to 
be able to eliminate the bias, but also for the preservation of a 
reasonably large amount of information through the nulling pro- 
cess. JS08 examined the number of redshift bins required for 
the nulling technique in the two-point case, and showed that 
10 redshift bins already ensure that parameters are still well- 
constrained after nulling. 

To re-assess this problem at the three-point level, we con- 
sider two different situations to address the requirements coming 
from control of the intrinsic-shear alignment and preservation of 
the information content separately. In both cases we split the red- 
shift range between z = and z — 3 into 5, 10, 15, and 20 (only 
in the first situation) redshift bins, with the redshift bins split in 
a way that there is an equal number of galaxies in each bin. 

First we consider a single cosmological parameter, D. m , to be 
free and study the biases introduced by the GGI signal on Q m 
both before and after nulling. We use only equilateral triangle 
configurations to reduce the amount of calculation. The results 
are shown in Fig. [4] Within the range of consideration, the ratio 
of the nulled and the original biases drops quickly with the in- 
crease of the number of redshift bins for all GGI models. For 
most of the models, 5 redshift bins seem to be not sufficient 
for the nulling technique to control the bias induced by GGI 
down to a percent level. Going from 5 redshift bins to 10 red- 
shift bins is very rewarding in terms of bias reduction. However, 
we note that a decrease |bf/bi| doesn't neccessarily indicate a 
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Fig. 4. Ratio of the nulled and the original biases for cosmolog- 
ical parameter Q m as a function of number of redshift bins N z . 
Results for different GGI models are shown. Parameter s and r 
are the slopes of angular frequency and redshift dependence of 
our power-law model (13 It . 
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Fig. 5. Figure of merit (FoM) as defined in d46i > in the Q m - cr g 
plane as a function of number of redshift bins N z . FoM from 
two-point (2p) measures, three-point measures (3p) and com- 
bined (2p+3p) are shown both before nulling (original) and after 
nulling (nulled). 



better performance of the nulling method, or generally speak- 
ing, of any method intended to control the intrinsic-shear align- 
ment. One can see the reason for this by noticing that, it is the 
original unbinned GGI/GGG signal that is directly controlled by 
any of these methods. Between [bf/bi| and the original unbinned 
GGI/GGG signal lies the binning process as well as the sum- 
mation over angular frequency bins and redshift bins. Since the 
signs of the biases contributed by different angular frequencies 
and redshifts can be different, there can be bias cancellation dur- 
ing these processes. In another word, |bf/b;| can depend on bin- 
ning choices. 

We then vary two cosmological parameters (Q m and <x 8 ) and 
investigate how the original and the nulled parameter constraints 
change with respect to the number of redshift bins available. For 
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Fig. 6. Projected 1-sigma 
(68 % CL) parameter con- 
straints from cosmic shear 
bispectrum tomography. 
Hidden parameters are 
marginalized over. The 
black solid and blue dash- 
dotted ellipses correspond 
to the original constraints 
and those after nulling, 
respectively. The black cross 
in the center of each panel 
represents the fiducial values 
adopted for the parameters, 
and the distance from the 
center of one ellipse to 
the black cross reflects the 
bias caused by intrinsic- 
alignment GGI systematics 
on the corresponding param- 
eter. As nulling can be seen 
as a linear data compression 
under the constraint of the 
nulling condition, we also 
plot the constraints and bi- 
ases after an unconditioned 
linear data compression as 
magenta dashed ellipses for 
comparison (see Sect. l5.4b . 



this case we use all triangle shapes to enable a comparison with 
results for two-point statistics. 

Our result (Fig. [5]) shows that a further increase of the num- 
ber of redshift bins beyond 10 is not very rewarding in terms 
of information preservation as characterized by the FoM, in ei- 
ther 2p, 3p, or 2p+3p cases. This suggests, when the possibility 
of more redshift bins exists, the choice of redshift bin number 
should be based mainly on the requirement of bias reduction 
level in case of negligible photometric errors. When there are 
non-negligible photometric errors, howev er, the information loss 
will pr obably be more severe, as found bv lJoachimi & Schneider! 
(2009) for the two-point case. 

5.4. The nulling technique as a conditioned compression of 
data 

The necessity of carrying out data compression in cosmol- 
ogy has long been recognized (e.g. lTegmark et all 19971) and has 
been ever increasing due to the increasing size of the data sets. 
In cosmic shear studies the survey area of next generation multi- 
color imaging surveys will be an order of magnitude larger than 
the current ones. The study of three-point statistics also implies a 
huge increase in the amount of data directly entering the Fisher- 
matrix/ likelihood analysis, compared to the two-point case. 

The basic principle of data compression is to reduce the 
amount of data while preserving most of the information. This is 
already naturally encoded in the nulling technique. If one keeps 
only the first-order weights for nulling, as we do in this paper, 
the nulling procedure reduces the number of data entries in each 
angular frequency bin from the number of redshift bin (z, j, k) 
combinations, to the number of (/, j) combinations, which means 
roughly from A^ 3 to A^ 2 . The nulling transformation is linear 



Table 1. Change of cosmic shear bispectrum statistical power 
after nulling (null) and linear data compression (compress). 
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null 


null/i 


compress 


compress/i 




21455 


4609 


21.5% 


12242 


57.1% 


fim-Wo 


637 


123 


19.3 % 


428 


67.2 % 




145 


33 


23.0 % 


110 


75.9 % 


0"8" WO 


434 


87 


20.0 % 


299 


68.9 % 


0"8" W a 


101 


26 


25.4 % 


72 


71.3% 


w - w a 


11.4 


2.3 


20.2 % 


8.0 


70.2 % 



Notes. Presented are FoM on two-dimensional parameter planes be- 
tween cosmological parameters fl m , cr 8 , w and w a . The cosmological 
parameters h, Sit, and n s are marginalized over. The second column is 
the FoM from the original bispectrum; the third and fifth columns are 
FoM from the nulled and the compressed measures, respectively; the 
fourth (sixth) column shows the percentage of the third (fifth) column 
compared to the first column, which reflects the relative information 
loss through the nulling (the unconditioned compression) procedure. 



since the resulting nulled entry is a linear combination of k orig- 
inal entries weighted by the nulling weight ( fTTl l. In the sense that 
an "optimum" set of nulling weights is constructed, the nulling 
technique also intends to preserve as much information as pos- 
sible. But there is yet another additional constraining condition 
in the nulling procedure: the nulling condition (0, which largely 
confines the shape of the nulling weights by requiring the ex- 
istence of at least one zero-crossing (see Fig.|2]l. In short, the 
nulling technique can be seen as a conditioned linear compres- 
sion of data. 

It is then interesting to know how much of the information 
loss during the nulling process actually comes from the nulling 
condition, and how much just comes from the fact that a data 
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Fig. 7. The thick green 
(gray) solid, thick blue 
(black) solid and thin black 
dashed ellipses indicate 1- 
sigma (68 % CL) parameter 
constraints from the nulled 
power spectrum measures, 
bispectrum measures, and 
combined. Hidden parame- 
ters are marginalized over. 
The distance from the center 
of an ellipse to the black 
cross reflects the nulled bias 
on the corresponding pa- 
rameter. The original biases 
from bispectrum measures 
can be seen in Fig|6] The 
thin black solid ellipses 
over-plotted on to the cen- 
ters of the nulled combined 
constraint ellipses indicate 
the statistical power (68% 
CL) of combined constraints 
before nulling. Note the dif- 
ferent ranges of parameters 
compared to Fig|6] 



Table 2. FoM before ('/') and after ('/') nulling and their ratio, using the cosmic shear power spectrum (2pt), bispectrum (3pt), and 
combined (2pt+3pt) analysis. 
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compression process is naturally involved in nulling. To explore 
this, we perform an unconditioned linear data compression, by 
simply ignoring the nulling condition in the whole nulling pro- 
cedure i.e. dropping the Lagrange multiplier term in ( 1431 , but 
otherwise keeping the simplifications inherent to the analytical 
approach. The results are shown in Fig. [6] A summary of the 
FoM from the original and the nulled bispectrum measures as 
well as the compressed measures is shown in TableQ] 

In contrast to nulling, an unconditioned linear compression 
does not eliminate the parameter bias, but increases or reduces 
some of them marginally. Regarding the parameter constraints, 
although the increase in the size of the ellipses is much less than 
in the case of nulling, around one third of the information in 
terms of FoM is lost through compression, which means that 
the amount of degradation in parameter constraints after com- 
pression is not negligible. This suggests that keeping only the 
first-order terms contributes to non-negligible information loss. 
To regain part of this information, one could add higher-order 
weights to the nulling procedure. But the difference between the 
nulled and the compressed FoM serves as an indication for the 
inevitable information loss through the nulling process, which is 
imposed by the nulling condition. 



5.5. Two-point and three-point constraints combined 

Besides constraining cosmological parameters using three- 
point cosmic shear alone, we investigate the combined con- 
straints from both two-point and three-point cosmic shear mea- 
sures. The performance of the nulling technique on cosmic shear 
power spectrum tomography alone and the resulting constraints 
on cosmological parameters were presented in JS08. For con- 
sistency, we use the same setting for the cosmic shear power 
spectrum as described for the bispectrum in Sect. [3] In partic- 
ular, we neglect photometric redshift errors, use only a limited 
range and number of ^-bins, and adopt a power-law intrinsic- 
shear alignment model with a form described by (36) in JS08 
and a slope of 0.4. We have confirmed the consistency between 
our power spectru m and bispectrum codes with those used in 
iBerge et aU (l2010 | ). Our powe r spectrum code agrees also with 
iCosmo dRefregier et alj|2008l) . 

Figure |7] shows the resulting constraint ellipses after nulling 
from the cosmic shear power spectrum analysis, the bispectrum 
analysis, and the two combined. To show how much information 
is lost during the nulling process, we overplot the original two- 
and three-point combined constraints on top of the nulled con- 
straint ellipses in Fig. [7] but center them on the corresponding 
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nulled constraints by subtracting the bias difference before and 
after nulling. The information content in terms of FoM for each 
parameter pair is presented in Table[2] 

One sees that the amount of information contained in bispec- 
trum measures and power spectrum measures are indeed compa- 
rable. With bispectrum information added, typically three times 
better constraints in terms of FoM are achieved, both before and 
after nulling. This factor is smaller than the result in TJ04, al- 
though the same angular frequency range and the same set of 7 
cosmological parameters are chosen for both studies. However 
a direct comparison is prohibited by different fiducial values 
adopted and different survey specification. 

Through the nulling procedure, around 15 % of the origi- 
nal information in terms of FoM is preserved in the two-point 
case, and around 20 % in the three-point case. It is a bit higher 
in the three-point case, in accordance to the fact that a roughly 
A^ 3 — > N z 2 compression is involved in the three-point case and a 
A^ 2 — > N z l one in the two-point case, while this fact is due to the 
summation over one redshift bin index during the nulling proce- 
dure (the same trend is evident in Fig. [5]). The information loss 
is considerable, but it is a price to pay for a model-independent 
method. As we have discussed in the previous subsection, the 
difference between the information loss through the nulling and 
the unconditioned compression procedures represents the in- 
evitable loss of information through nulling. However, this dif- 
ference is less than 50 % in the considered three-point case. The 
other information loss is due to the simplifications we adopted 
in this study, including using only the first-order weights, and 
discarding the measures with two or three equal redshift bins. A 
further detailed consideration of these aspects can regain part of 
the lost information. Another simplification we have made in the 
three-point case is to use only triangles with three different an- 
gular frequencies. This reduces both the original and the nulled 
information contained in the three-point measures. However, this 
simplification can be easily removed with a careful distinction of 
all cases. 

Also notice that, the dependence of number of possible bis- 
pectrum modes, i.e. triangles, on the maximum angular fre- 
quency £ max is roughly l^.^, while that of power spectrum modes 
is roughly { l m . ix . For this study £ m - dx = 3000 is chosen. If reliable 
information on smaller angular scales can be obtained, the three- 
point statistics will possibly give us more information than the 
two-point statistics. 

6. Conclusion 

In this study we developed a method to control the intrinsic- 
shear alignment in three-point cosmic shear statistics by gener- 
alizing the nulling technique. We showed that the generalization 
of the nulling technique to three-point statistics is quite natural, 
providing a model-independent method to reduce the intrinsic- 
shear alignment signals (GGI and Gil) in comparison to the lens- 
ing GGG signal. 

To test the performance of the nulling technique, we as- 
sumed a fictitious survey with a setup typical of future multi- 
color imaging surveys, and applied the nulling technique to the 
modeled bispectra with intrinsic-shear alignment contamination. 
The lensing bispectra (GGG) was computed based on pertur- 
bation theory, while the GGI signal was modeled by a simple 
power-law toy model. We focused on the reduction of the GGI 
contaminant, since Gil can be removed simply by not consider- 
ing tomographic bispectra with two or three equal redshift bins. 

The reduction of the intrinsic-shear alignment contamination 
at the three-point level by the nulling technique was demon- 



strated both in terms of the GGI/GGG ratio, and in terms of bi- 
ases on cosmological parameters in the context of an extended 
Fisher matrix study. In terms of the GGI/GGG ratio, a factor of 
10 suppression is achieved after nulling over all angular scales. 
Correspondingly, the biases on cosmological parameters are re- 
duced to be less than or comparable to the original statistical er- 
rors. We studied the performance of the nulling technique when 
5, 10, 15, or 20 redshift bins are available, and found that the per- 
formance on bias reduction, rather than how much information 
is preserved during the nulling procedure, depends more signifi- 
cantly on the number of redshift bins. In case one requires better 
control of intrinsic-shear alignment, more detailed redshift in- 
formation allowing more redshift bins is the most direct way to 
go- 
When dealing with real data, there is one further source 
of complication which we did not consider in this paper, 
that is the photometric redshift uncertainty. The photomet- 
ric redshift uncertainty can be characterized by a redshift- 
dependent photometric redsh ift scatter and catastrophic outliers. 
iJoachimi & Schneider! (|2009) studied the influence of photomet- 
ric redshift uncertainty on the performance of the nulling tech- 
nique at the two-point level. They found that the photometric 
scatter places strong bounds on the remaining power to con- 
strain cosmological parameters after nulling. The existence of 
catastrophic outliers, on the other hand, can lead to an incom- 
plete removal of the intrinsic (II, III) alignments as well as the 
intrinsic-shear alignments (GI, Gil, GGI). However, methods to 
control the photometric redshift uncertainty have been proposed. 
For example, recent studies concerning the problem of catas- 
trophic outliers point to the solutions of either limiting the lens- 
ing analysis to z < 2.5 or b y conducting an additional small- 
scale spectroscopic survey dSun et al.l20 09: Bern stein & Hutererl 
120091: iBordoloi et al.ll20Toh . 

As already demonstrated by JS08 in the two-point case, some 
information loss is inherent to the nulling procedure. For the 
setup of this paper we found that, in terms of FoM about 20 % of 
the original information is preserved through the nulling proce- 
dure in the three-point case, and 15 % in the two-point case. We 
further studied the source of such information loss by comparing 
the nulling technique to an unconditioned linear compression of 
the data, since the nulling procedure can be seen as a linear com- 
pression of data under the constraint of the nulling condition (0. 
We found that around one third of the original information is lost 
through an unconditioned compression of the data, suggesting 
that this situation can be improved by considering higher-order 
terms in the nulling and compression processes. 

Results on parameter constraints from the two- and three- 
point cosmic shear statistics combined are also presented. The 
amount of nulled information contained in bispectrum measures 
and power spectrum measures are comparable. With bispectrum 
information added, typically three-times better constraints are 
achieved both before and after nulling, in terms of FoM. 

Again, due to the large amount of information existing in the 
three-point cosmic shear field, one would certainly like to exploit 
it in the future. The nulling method we developed in this work 
solves a potentially severe problem hampering the use of three- 
point information, namely the intrinsic-shear alignment system- 
atic. Our method works at the cost of a large information loss, 
which can hopefully be avoided by a future method of removing 
the intrinsic-shear alignment contaminants. But as the only com- 
pletely model-independent method so far, the nulling technique 
can serve as a working method now and can provide a valuable 
cross check even with the availability of better methods. 
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Appendix A: Counting of triangles 

A triangle is specified by six indices, i.e. three redshift bin 
indices {i, j, k] and three angular frequency bin indices l\, ti, {3. 
To ensure that we count each triangle configuration only once, 
we set the condition that t,\ <li <€■>,. Moreover, we would like 
the first index among [i, j, k] in (l36l l to have the lowest redshift, 
i.e. Zi < Zj and zi < Zk, for the convenience of performing the 
nulling technique. The possible [i, j, k] combinations under these 
constraints in the case of /V; = 4 are listed in Fig. IA.ll 



ijk ijk ijk 




Fig.A.l. List of possible triangles (redshift bin combinations) 
with condition z, < zj and z, < zu when 4 redshift bins are avail- 
able. An angular frequency combination satisfying l\ < I2 < It, 
is chosen. Note that the redshift indices and the angular frequen- 
cies are linked in pairs due to the definition of the tomographic 
bispectrum (TBI . In this paper a default of 10 redshift bins is as- 
sumed. 

However, setting both conditions is problematic. Inspecting 
the definition of the tomographic bispectrum i 141 . one sees that 
the redshift indices and the angular frequencies are linked in 
pairs, e.g. convergence k in redshift bin i has angular frequency 
l\ , which is not desirable since the smallest angular scale does 
not necessarily correspond to the lowest redshift. To solve this 
problem, we perform nulling three times for each general an- 
gular frequency combination with £\ < < swapping the 
redshift-angular scale correspondence in-between, thus allowing 
each redshift to be able to correspond to any angular frequency. 

Note that the situation complicates a bit when two of the 
angular frequencies are equal, since then the swapping may lead 
to exactly the same configuration. To avoid this, we will restrict 
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ourselves to three different angular frequencies. This can exclude 
a high percentage of possible configurations. In our case, i.e. 20 
logarithmically spaced bins between ( m { n = 50 and { m . dx = 3000, 
37 % of the angular frequency combinations which can form a 
triangle have been excluded. However, this is only a technical 
complication which can be solved with a careful distinction of all 
cases. Since this paper is intended to be a proof of applicability 
of the nulling technique to three-point statistics, we defer the 
intricacies of accounting for all triangle configurations to future 
work. 
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